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Abstract 

In this paper, we studied the research areas of Chinese natural science basic re- 
search from a point view of complex network. Two research areas are considered to 
be connected if they appear in one fund proposal. The explicit network of such con- 
nections using data from 1999 to 2004 is constructed. The analysis of the real data 
shows that the degree distribution of the research areas network (RAN) may be 
better fitted by the exponential distribution. It displays small world effect in which 
randomly chosen pairs of research areas are typically separated by only a short path 
of intermediate research areas. The average distance of RAN decreases with time, 
while the average clustering coefficient increases with time, which indicates that the 
scientific study would like to be integrated together in terms of the studied areas. 
The relationship between the clustering coefficient C(k) and the degree k indicates 
that there is no hierarchical organization in RAN. 
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1 Introduction 



In the past few years there has been a growing interest in the study of com- 
plex networks. The boom has two reasons: the existence of interesting applica- 
tions in several biological, sociological, technological and communications sys- 
tems and the availability of a large amount of real data [1,2,3,4,5,6,7,8,9,10,11]. 
Recent works on the mathematics of networks have been driven largely by the 
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observed properties of actual networks and the studies on network dynamics[12, 13, 14,15, 16, 17,18, 19,20, S 
optimization[23,24,25,26,27,28], and evolutionary[29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46, 
It also makes sense to examine simultaneously data from different kinds of net- 
works. Recent approaches with methodology rooted in statistical physics focus 
on large networks, searching for universality both in the topology of the real 
networks and in the dynamics governing their evolution [49]. These combined 
theoretical and empirical results have opened unsuspected directions for re- 
searches and a wealth of applications in many fields ranging from computer 
science to biology and sociology [3,4,6,50,51]. In this respect, three important 
results have been crystallized: First, it has been found that the degree distri- 
bution contains important information about the nature of the network, for 
many large networks following the exponential distribution and the power-law 
distributions. Second, most networks have the so-called small world property 
[2], which means that the average distance between different pairs of nodes 
is rather small. Third, real networks display a degree of clustering coefficient 
higher than expected for random networks [2,4]. Finally, the assortative mix- 
ing is studied to answer why social networks are different from other types of 
networks [53]. 

The scientific studies can be considered as being organized within a network 
structure, which has a significant influence on the observed study collective 
behaviors. The viewpoints of complex networks are of interest in studying sci- 
entific study networks, to uncover the structural characteristics of the networks 
built on RAN. In this paper, the research areas of natural science basic re- 
search is studied from the point view of complex network [54]. In the fund man- 
agement department, such as National Natural Science Foundation of China 
(NSFC), the research areas are denoted by the code system, which have the 
tree structure to demonstrate the inclusion relation between the research ar- 
eas, such as Physics->statistical physics-> complex network. The leave codes 
of the code system always represent the research areas more specially. To make 
the network reflect the reality more accurately, in this paper, the nodes are 
defined as the leave nodes of the code system. Because the scientists can fill 
in the fund proposal two codes: the first application code and the second one, 
then if one requisition paper filled in two different codes one can consider 
that the research work is cross the two research areas. The proposals filled in 
only one code are not considered in RAN. By this definition, there are 371, 
349, 367, 400, 456, 544 nodes in RAN from 1999 to 2004. Three complemen- 
tary approaches allow us to obtain a detailed characterization. First, empirical 
measurements allow us to uncover the topological measures that characterize 
the network at a given moment, as well as the time evolution of these quanti- 
ties. Second, the average distance of RAN decreases with time, which means 
that the distance between any pairs of research areas is getting short. Third, 
the average clustering coefficient increases with time, which means that the 
neighbors of one research area would like to be connected with each other. 
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Fig. 1. The degree distribution of RAN from 1999 to 2004. 

This paper is organized as follows: In section 2, the topological character- 
istics of RAN, such as the degree distribution, clustering coefficient, average 
path length, assortative coefficient and the relationship between the clustering 
coefficient and the degree k are investigated and visualized. In section 3, the 
conclusion and discussion are given. 



2 Data Analysis of RAN 

In this section, the topology and dynamics of the empirical network are 
investigated. The parameters that are crucial to the understanding of the 
topology of RAN are extracted. The analysis of the data could provide the 
development trend of Chinese natural scientific basic research system. 

2. 1 Degree distribution follows the exponential form 

The degree distribution P(k) presents the probability that a randomly se- 
lected node has k links, which has been studied extensively for various net- 
works. Networks for which P(k) has a power-law tail, are known as scale-free 
networks [51,55]. On the other hand, classical network models, including the 
Erdds-Renyi model [56,57] and the Watts and Strogatz model [1] have an 
exponentially decaying P(k) and are collectively known as exponential net- 
works. Another observed degree distribution form of real-life networks [6, 58], 
named Stretched exponential distribution (SED) [59] is of the form P(x)dx = 
/i(a; ( ' M ~ 1 V^o)exp(— (x/x Q Y)dx and its accumulative distribution is P(x) = 
exp(—(x/x )^), which can be stated as InP(x) ~ x^ . Obviously, SED de- 
generates to an exponential distribution when \i is close to 1 and to a power 
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Fig. 2. The cumulation degree distribution of RAN from 1999 to 2004. 

law when \x is close to 0. When \x is between and 1, the degree distribution 
is between a power law and an exponential function. The data on a single- 
logarithmic plane show that the degree distribution of RAN can well fitted of 
an exponential form, see Fig. 1 and Fig. 2. 

It seems difficult to find a common function to fit the empirical network 
because of the fluctuations. However, the accumulative distribution of RAN 
has less fluctuations and is more stable. The distribution of all the data from 
1999 to 2004 can be fitted by an exponential distribution, given by P(k) = 
exp(0.13/c). The hub nodes of RAN and their degrees from 1999 to 2004 are 
demonstrated in Table I. 

Table 1 The hub nodes of RAN and their degrees from 1999 to 2004. 



Hub nodes degree 

1999 Computer-aided design 28 

2000 Intelligent information processing(IIP) 31 

2001 IIP 37 

2002 Pattern cognition, Artificial intelligence(PC & AI) 33 

2003 PC & AI 52 

2004 PC & AI 63 
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Fig. 3. The network diagram of 2004. The marked points are the research areas of 
Network theory and Artificial Intelligence. 

2.2 Average distance decreases with time 

The ability of two nodes, i and j, to communicate with each other depends 
on the length of the shortest path dij between them. The average of dij over 
all pairs named average distance D = Ar(j ^_ 1 ^ J2ij dij. 
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Fig. 4. Average distance of RAN. The separation is computed on the data up to the 
indicated year. 

Fig. 4 shows that D decreases with time, which is highly surprising because 
most network models so far predict that the average distance should increases 
with system size. The decreasing trend observed by us could have different 
origins. It is possible that the increasing interconnectivity decreases the aver- 
age distance. In analogy with the social networks, one can say that there are 
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3.8 degrees of separation between the research 

One can note the continuous declining of the average distance and the more 
connected nature of RAN fields expressed by a smaller separation. The mono- 
tone declining indicates that the scientists tend to study in crossing areas. 

2.3 Average Clustering coefficient increases with time 

The clustering coefficient C; can be defined as follows [1]: pick a node i that 
has links to fc, nodes in the system. If these h% nodes form a fully connected 
clique, there are ki(ki — l)/2 links between them, but in reality we find much 
fewer. Denote the number of links that connect the selected k{ nodes to each 
other by Ei. The clustering coefficient for node i is then Cj = 2Ei/[ki(ki — 
1)]. The average clustering coefficient for the whole network is obtained by 
averaging over all nodes in the system C = jqY^f=\Ci. In simple terms, 
the clustering coefficient of RAN indicates the probability that two different 
nodes connect to the same node. The data of RAN indicates average clustering 
coefficient decrease with time, which can be seen from Fig. 5. 
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Fig. 5. Average clustering coefficient of RAN, determined for the data up to the 
indicated year. 

2.4 Average degree increases 

The number of nodes of RAN increases with time because of the arrival 
of new research areas. The growth of total number of links comes from two 
parts. One is the connections generated by new research areas with old ones, 
the other is the new connections between old nodes. A quantity characterizing 
the network's interconnections is the average degree (k), denoting the average 
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number of links per node. The time dependence on (k) of RAN is shown in 
Fig. 6. 
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Fig. 6. Average number of links per node for RAN. Results are computed on the 
data up to the given year. 



2.5 Assortative Coefficient 

The assortative coefficient A measures the tendency of a network to connect 
vertices with the same or different degrees [52,53]. If A > 0, the network is 
said to be assortative, non-assortative when A = and disassortative when 
A < 0. A is defined as follows 

A = -^J2J2( e ik-QjQk), (1) 

a Q 3 k 



Table 2 Temporal evolution of some quantities of RAN. In the first 
column, v is the number of vertices, k is the mean connectivity, C is the 
clustering coefficient, and D is the average shortest path length 





1999 


2000 


2001 


2002 


2003 


2004 


V 


371 


349 


367 


400 


456 


544 


k 


4.264 


4.768 


4.986 


5.235 


6.197 


6.379 


C 


0.097 


0.113 


0.107 


0.121 


0.154 


0.197 


D 


5.367 


4.806 


4.525 


4.273 


4.060 


3.765 


isolate node 


18 


29 


19 


14 


7 


8 


A 


-0.0187 


0.0033 


0.0796 


-0.0084 


-9.7050e-004 


-0.0899 
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where e^ is the probability that a randomly chosen edge has vertices with 
degree j and k at either end, q% = J2j e jk and a 2 = J2kk 2 qk — (J2kk<lk) 2 - 
The possible values of A lies in the interval — 1 < A < 1. Fig. 7 shows that 
the assortative coefficient begin to decline from 2001 and it decreases to the 
negative value from 2002. This indicates that RAN tend to be disassortative 
network. 




1999 2000 2001 2002 2003 2004 

Year 



Fig. 7. Assortative coefficient of RAN. Results are computed on the data up to the 
given year. 

2.6 Dependence between clustering coefficient and degree 

This intrinsic hierarchy can be characterized in a quantitative manner us- 
ing the recent findings of Dorogovtsev et. al [45] and Ravasz et. al [46]. The 
hierarchical organization, implying that small groups of nodes organize in a 
hierarchical manner into increasingly large groups. In RAN, the relationships 
between clustering coefficient and the degree of clustering characterizing from 
1999 to 2004 are demonstrated in Fig. 8. 

From Fig. 8, one can see that there are no dependence between clustering 
coefficient and k from 1999 to 2004, which means that there is no hierarchical 
organization in RAN. 



3 Conclusion and Discussion 

In this paper, we have constructed a research areas network and pointed out 
that such a network falls into the exponential distribution and small-world 
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Fig. 8. Dependence between clustering coefficient and k from 1999 to 2004. 

networks categories. With the observation on the section 2, RAN tend to be 
a disassortative network. The average distance of RAN decreases with time, 
while the clustering coefficient increases with time. This indicates that the 
research areas are tend to be crossed studied by the scientists. The dependence 
between clustering coefficient and k indicates that there is no hierarchical 
organization in RAN from 1999 to 2004. 

The characteristics of RAN may lies in the following reasons. 

1. The code system is changeless in a period of time, but the scientific study 
develops very fast, there are new research subjects emerge everyday So some 
scientists can not find the suitable code which could describe their research 
content but only fill in the proposal two corelational codes. 

2. The research subjects become more complexity, which can not be solved 
by one or two fields of knowledge. So the contents actuate the scientists who 
studied them to fill in their proposal two codes. 

We hope that the work presented here may stimulate further research on this 
subject. Some opened questions are, for instance, whether the results obtained 
by RAN held for different countries? What is the evolving mechanism of RAN? 



Acknowledgement 



The authors are grateful to Dr. Qiang Guo, Bing Wang, Zhong-Zhi Zhang 
and Tao Zhou for their valuable comments and suggestions, which have led to 
a better presentation of this paper. This work has been supported by the Na- 
tional Science Foundation of China under Grant Nos. 70431001 and 70271046. 



9 



References 



[1] D. J. Watts and S. H. Strogatz, Nature 393 (1998) 440. 

[2] S. N. Dorogovtsev and J. F. F. Mendes, Evolution of Networks (Oxford University 
Press, Oxford, 2003). 

[3] P. S. Dodds, R. Muhamad and D. J. Watts, Science 301 (2003) 827. 

[4] R. N. Onody and P. A. Castro, Phys. Rev. E 70 (2004) 037103. 

[5] T. Xu, J. -C. Chen, Y. He and D. -R. He, Int. J. of Mod. Phys. B 18 (2004) 2599. 

[6] P. -P. Zhang, K. Chen, Y. He, T. Zhou, B. -B. Su, Y. -D. Jin, H. Chang, Y. -P. 
Zhou, L. -C. Sun, B. -H. Wang and D. -R. He, Physica A 359 (2005) 835 . 

[7] R. Albert and A.-L. Barabasi, Rev. Mod. Phys. 74 (2002) 47. 

[8] S. N. Dorogovtsev and J. F. F. Mendes, Adv. Phys. 51 (2002) 1079. 

[9] M. E. J. Newmann, SIAM Rev. 45 (2003) 167. 

[10] X. F. Wang, Int. J. Bifurcat. Chaos 12 (2002) 885. 

fill X. F. Wang, G. Chen, IEEE Circuits Systems Mag. 3 (2003) 6. 

[12] B. Tadic, S. Thurner, and G. J. Rodgers, Phys. Rev. E 69 (2004) 036102. 

[13] L. Zhao, Y. -C. Lai, K. Park and N. Ye, Phys. Rev. E 71 (2005) 026125. 

[14] G. Yan, T. Zhou, B. Hu, Z. -Q. Fu and B. -H. Wang, arXiv: |conclmat /050 5366 

[1 5] C. -Y. Yin, B. -H. Wang, W. -X. Wang, T. Zhou, and H. -J. Yang, arXiv: 
|physics/050620'4l 



[16] R. Pastor-Satorras and A. Vespignani, Phys. Rev, Lett. 86 (2001) 3200. 

[17] G. Yan, T. Zhou, J. Wang, Z. -Q. Fu and B. -H. Wang, Chin. Phys. Lett. 22 
(2005) 510. 

[18] T. Zhou, Z. -Q. Fu, and B. -H. Wang, arXiv: physics/0508096 

[19] A. E. Motter and Y. -C. Lai, Phys. Rev. E 66 (2002) 065102. 

[20] K. -I. Goh, D. -S. Lee, B. Kahng and D. Kim, Phys. Rev. Lett. 91 (2003) 148701. 

[21] T. Zhou and B. -H. Wang, Chin. Phys. Lett. 22, (2005) 1072. 

[22] T. Zhou, B. -H. Wang, P. -L. Zhou, C. -X. Yang and J. Liu, Phys. Rev. E 72 
(2005) 016139. 

[23] A. X. C. N. Valente, A. Sarkar and H. A. Stone, Phys. Rev. Lett. 92 (2004) 
118702. 

[24] G. Paul, T. Tanizawa, S. Havlin and H. E. Stanley, Eur. Phys. J. B 38 (2004) 
187. 



10 



[25; 
[26; 
[27; 
[28; 

[29 
[30 
[31 
[32 
[33 
[34 
[35 

[36 

[37; 
[38; 

[39 
[40 
[41 

[42; 

[43; 
[44; 
[45; 

[46; 

[47 



S. N. Dorogvtsev and J. F. F. Mendes, Europhys. Lett. 52 (2000) 33. 
J. Kleinberg, Nature 406 (2000) 845. 

M. A. de Menezes, C. Moukarzel and T. J. P. Penna, Europhys Lett. 50 (2000) 
574. 

C. F. Moukarze, Phys. Rev. E 60 (1999) 6263. 

T. Zhou, B. -H. Wang, P. -M. Hui and K. -P. Chan, arXiv: |cond"-mat /0405258| 



B. Wang, H. -W. Tang, Z. -L. Xiu, C. -H. Guo and T. Zhou, |cond-inat7 050971 1 

B. Wang, H. -W. Tang, C. -H. Guo and Z. -L. Xiu, |cond-mat/0506725| 

J. -G. Liu, Z. -T. Wang and Y. -Z. Dang, Mod. Phys. Lett. B 19 (2005) 785. 



J. -G. Liu, Z. -T. Wang and Y. -Z. Dang, cond-mat/0509290 

S. H. Strogatz, Nature 410 (2001) 268. 

M. E. J. Newman, J. Stat. Phys. 101 (2000) 819. 

X. Li, Y. -Y. Jin and G. Chen, Physica A 328 (2003) 287. 



R. Kasturirangan, arXiv: cond-mat/9904055 . 



T. Zhou, G. Yan and B. -H. Wang, Phys. Rev. E 71 (2005) 046141. 

Z. -Z. Zhang, F. Cornelias, G. Fertin and L. -L. Rong, arXiv:cond-mat/0503316 . 

B. Wang, Z. -Z. Zhang, H. -W. Tang and Z. -L. Xiu, arXiv:cond-mat7 0509022 . 

J. S. Andrade, J. H. Hermann, R. F. S. Andrade and L. R. da Silva, Phys. Rev. 
Lett. 94 (2005) 018702. 

W. -X. Wang, B. Hu, T. Zhou, B. -H. Wang and Y. -B. Xie, Phys. Rev. E 72 
(2005) 016140. 

F. Cornelias, G. Fertin and A. Raspaud, Phys. Rev. E 69 (2004) 037104. 
F. Cornelias and M. Sampels, Physica A 309 (2002) 231. 

S. N. Dorogovtsev, A. D. Goltsev and J. F. F. Mendes, Phys. Rev. E 65 (2002) 
066122. 

E. Ravasz and A. -L. Barabasi, Phys. Rev. E 67 (2003) 026112 . 

Z. -M. Gu, T. Zhou, B. -H. Wang, G. Yan, C. -P. Zhu and Z. -Q. Fu, 
arXiv:cond-mat/0505175 . 



[48] P. -Q. Jiang, B. -H. Wang, T. Zhou, Y. -D. Jin, Z. -Q. Fu, P. -L, Zhou and X. 
-S. Luo, Chin. Phys. Lett. 22 (2005) 1285. 

[49] A. -L. Barabdsi, H. Jeong, Z. Neda, E. Ravasz, A. Schubert and T. Vicsek, 
Physica A 311 (2002) 590. 



11 



[50] X. Li, Y. -Y. Jin and G. Chen, Physica A 343 (2004) 573. 

[51] A. -L. Barabasi and R. Albert, Science 286 (1999) 509. 

[52] M. E. J. Newman, Phys. Rev. Lett. 89 (2002) 208701. 

[53] M. E. J. Newman, J. Park, Phys. Rev. E 68 (2003) 036122. 

[54] Data were gathered from the fund management deparment of China. 

[55] A. -L. Barabasi, R. Albert and H. Jeong, Physica A 272 (1999) 173. 

[56] P. Erdos and A. Renyi, Publ. Math. 6 (1959) 290. 

[57] P. Erdos and A. Renyi, Publ. Math. Ins. Hung. Acad. Sci. 5 (1960) 17. 

[58] T. Zhou, Y. -D. Jin, B. -H. Wan g, D. -R. He, P. -P. Zhang, Y. He, B. -B. 
K. Chen and Z. -Z. Zhang, arXiv: |cond-mat/0502253| 

[59] J. Laherrere and D. Sornette, Eur. Phys. J. B 2 (1998) 525. 



12 



